Sequential Inductive Learning
نویسنده
چکیده
In this paper I advocate a new model for inductive learning. Called sequential induction, this model bridges classical fixed-sample learning techniques (which are efficient but ad hoc), and worst-case approaches (which provide strong statistical guarantees but are too inefficient for practical use). According to the sequential inductive model, learning is a sequence of decisions which are informed by training data. By analyzing induction at the level of these decisions, and by utilizing the minimum data necessary to make each decision, sequential inductive techniques can provide the strong statistical guarantees of worst-case methods, but with substantially less data than those methods require. The sequential inductive model is also useful as a method for determining a sufficient sample size for inductive learning and as such, is relevant to megainduction,where the preponderance of data introduces problems of scale. The peepholing and decision-theoretic subsampling approaches of Catlett, Musick, and Russell, which address problems of scale in inductive learning, have informed my work. The technique that I have developed generalizes and extends these prior approaches to megainduction. _______________________ This paper has not already been accepted by and is not currently under review for a journal or other conference. Nor will it be submitted for such during IJCAI’s review period. This work is supported by the National Science Foundation under Grant NSF–IRI–92-09394.
منابع مشابه
The Task Rehearsal Method of Sequential Learning
An hypothesis of functional transfer of task knowledge is presented that requires the development of a measure of task relatedness and a method of sequential learning. The task rehearsal method (TRM) is introduced to address the issues of sequential learning, namely retention and transfer of knowledge. TRM is a knowledge based inductive learning system that uses functional domain knowledge as a...
متن کاملInductive Learning in Less Than One Sequential Data Scan
Most recent research of scalable inductive learning on very large dataset, decision tree construction in particular, focuses on eliminating memory constraints and reducing the number of sequential data scans. However, state-of-the-art decision tree construction algorithms still require multiple scans over the data set and use sophisticated control mechanisms and data structures. We first discus...
متن کاملScaling Up Inductive Learning with MassiveParallelismFOSTER
Machine learning programs need to scale up to very large data sets for several reasons, including increasing accuracy and discovering infrequent special cases. Current inductive learners perform well with hundreds or thousands of training examples, but in some cases, up to a million or more examples may be necessary to learn important special cases with conndence. These tasks are infeasible for...
متن کاملInductive Logic Programming Used to Discover Topological Constraints in Protein Structures
This paper describes the application of the Inductive Logic Programming (ILP) program GOLEM to the discovery of constraints in the packing of beta-sheets in alpha/beta proteins. These constraints (rules) have a role in understanding the protein folding problem. Constraints were learnt for four features of beta-sheet packing: the winding direction of two sequential strands, whether two consecuti...
متن کاملParallel Induction Systems based on Branch and Bound
Symbolic inductive learning systems explore a space of hypotheses to derive the definition of concepts. Due to the size and complexity of this space, sequential algorithms use various heuristics that limit the classification accuracy of the generated definitions. Parallel search algorithm can rectify such limitations. In this paper we present the PARIS parallel symbolic inductive system. PARIS ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996